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(54) Apparatus for adapting the directional characteristic of microphones for voice control 



(57) Voice control systems are used in diverse tech- 
nical fields. In this case, the spoken words are firstly 
detected, usually by one or more microphones (M1 , M2) 
and then fed to a speech recognition system (REC), 
which determines from the detected signals those 
words having the highest probability. The robustness 
with respect to acoustic interfering influences can be 
increased by the microphone being fastened or held 
directly in front of the speaker's mouth. The invention 
enables voice control even from a relatively large dis- 

V1 



tance by virtue of the fact that interfering background 
noises are spatially separated by the use of directional 
microphones or microphone arrays (M1, M2). In this 
case, the directional characteristic is adapted by virtue 
of the fact that means for detecting the whereabouts of 
the user (IR1 , VR, CtR) are provided. One or more infra- 
red motion detectors are preferably used for this pur- 
pose. 



IR 



V2 



V3 




PROC 




REC 




CSYS 







< 

CO 

m 

CO 

o 



LU 



Fig. 1 



1 



EP 1 085 781 A2 



2 



Description 

[0001] The invention relates to an apparatus for 
adapting the directional characteristic of microphones 
for voice control, which can be utilized in particular for 5 
controlling appliances appertaining to consumer elec- 
tronics. 

Prior ar t 

[0002] Voice control systems are used in a multi- 
plicity of technical fields. In this case, the spoken words 
are firstly detected as sound signals, usually by one or 
more microphones, and then fed to a speech recogni- 
tion system. In this case, the speech recognition is usu- 
ally based on an acoustic model and a speech model. 
The acoustic model utilizes a large number of speech 
patterns, mathematic algorithms being used to indicate 
the words which acoustically best match a spoken word. 
The speech model in turn is based on an analysis which 
uses a multiplicity of document samples to ascertain the 
context in which, and how often, certain words are nor- 
mally used. Such speech recognition systems make it 
possible to recognize not only individual words but also 
fluently spoken sentences with high recognition rates. 
However, the recognition rate drops drastically when 
non-negligible background noises are present. 
[0003] The robustness with respect to such acous- 
tic interfering influences can be increased in various 
ways. Thus, in dictation systems for computers, a micro- 
phone on a headset frame is fastened directly in front of 
the speakers mouth. In these systems, a very constant 
signal and hence an, in some instances, appreciable 
recognition rate can be achieved only by the direct prox- 
imity to the mouth. It is likewise known to control a tele- 
vision set by speaking the operational commands into 
the microphone which is integrated in a remote control. 
However, the remote control must be held directly in 
front of the user's mouth in this case as well. 

Invention 

[0004] The invention is based on the object of spec- 
ifying an apparatus for voice control which enables suf- 
ficient interference immunity even in the event of voice 
input from a relatively great distance. This object is 
achieved by means of the apparatus specified in Claim 
1. 

[0005] In order to enable voice control even from a 
relatively great distance, the voice signal must be sepa- 
rated from interfering background signals. This can be 
effected by spatial separation using directional micro- 
phones or microphone arrays comprising two or more 
microphones. However, if the intention is to allow the 
user to move freely in the room during the voice input, 
instead of the said user being restricted to a limited 
area, e.g. centrally in front of the appliance, then the 
directional characteristic of these microphone arrays 



must be adapted. The microphone signals themselves 
could be used for this adaptation, in this case, the pres- 
ence of a corresponding signal must firstly be detected, 
and then the microphone array can be correspondingly 
adapted, and only afterwards can satisfactory speech 
recognition begin, However, this process sequence may 
require a number of seconds and thus means that the 
first syllables or even words cannot be recognized. 
[0006] In principle, the invention's apparatus for 
voice-activated remote control comprises one or more 
microphones for converting voice commands into elec- 
trical signals and a speech recognition unit for convert- 
ing these electrical signals into operational commands, 
and also means for detecting the whereabouts of the 
user, which yield a signal which can be used for adapt- 
ing the directional characteristic of the microphone or 
microphones to the whereabouts of the user 
[0007] The use of a separate detection means has 
the advantage that fast detection of the whereabouts of 
the user also always enables the first syllables or words 
of a command to be detected and recognized. In princi- 
ple, any sufficiently fast detection means is suitable for 
this purpose, 

[0008] The means for detecting the whereabouts of 
the user advantageously have one or more infrared 
motion detectors. 

[0009] Likewise, it may be advantageous if the 
means for detecting the whereabouts of the user have 
electrical contacts in addition to the infrared motion 
detectors or Instead of the infrared motion detectors. 
[0010] Preferably, a directional microphone may 
also be provided which is oriented to the whereabouts 
of the user. 

[0011] The apparatus according to the invention is 
particularly advantageously integrated in a consumer 
electronics appliance. 

Drawing 

[0012] An exemplary embodiment of the invention 
is described with reference to the drawing. 
[001 Z] The said drawing shows an arrangement for 
voice control according to the invention. 

Exemplary embodiments 

[0014] Fig, 1 schematically illustrates an apparatus 
according to the invention as it may be integrated e.g. in 
a television set, video recorder or DVD player. In order 
to detect the position of a user before the latter has 
issued a use command, an infrared position detector is 
provided. The latter is based on the principle that the 
intensity of the infrared radiation radiated by an object or 
a person depends on- the respective temperature. The 
infrared position detector comprises a multiplicity of 
infrared detectors IR, which are known to the person 
skilled in the art and are not, therefore, described in any 
further detail, since a single detector can determine only 
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the presence or motion of a user, but not position. 
Depending on the requirements made of the resolution, 
the arrangements may involve a plurality of individual 
detectors or an array of the kind also used for infrared or 
thermal image cameras. In this case, the observation 
range can be influenced by a lens or lens combination 
(not illustrated). The electrical signals obtained from the 
infrared radiation by the infrared detectors are then 
amplified by an amplifier V3 and pass to a control unit 
CIR, which determines from the detected signals the 
whereabouts and, if appropriate, also the direction of 
motion of the user. 

[0015] For the detection of the voice signals, a 
microphone array comprising two microphones M1 and 
M2 is provided. The latter convert the detected sound 
signals into electrical signals which are firstly amplified 
by amplifiers V1 and V2 and are then fed to a signal 
processing unit PROC, The latter then takes account of 
the respective whereabouts of the user by means of dif- 
ferent scaling or processing of the detected sound sig- 
nals. The signal processed in this way is then fed to a 
speech recognition unit REC, which converts the electri- 
cal signals into words. Finally, the commands corre- 
sponding to these words are then fed to a system 
manager CSYS for controlling the system, 
[001 6] In a microphone array comprising two micro- 
phones, the two microphones may advantageously be 
accommodated on the left-hand and right-hand sides of 
the housing of the respective appliance. However, the 
microphone array is in no way restricted to two micro- 
phones, but rather may also have more than two micro- 
phones. Likewise, it is also possible to use a directional 
microphone having a pronounced directional character- 
istic, which is then rotated depending on the detected 
whereabouts of the user. 

[0017] It is advantageous that, in a basic setting, it 
is possible to implement an orientation in the direction of 
the entrance area of the room in which the appliance is 
situated. The appliance is thus immediately ready for 
use as soon as a user enters the room. 
[0018] If a plurality of possible users are detected, 
then various procedures are conceivable. By way of 
example, the directional characteristic can always be 
adapted to the user detected first, so that the micro- 
phone array as it were follows this user. Likewise, the 
microphone array may also follow that user who per- 
forms a voice input first. Furthermore, it is also possible 
to effect a regular changeover of the orientation 
between the various possible users. Finally, the size of 
the detected objects can be evaluated in order, in this 
way, to prevent an orientation to the user's pets from 
being carried out. 

[0019] Furthermore, it is conceivable to detect the 
user entering the room by means of contacts on the 
door or on the light switches, instead of by means of the 
infrared detector. Finally, a combination of such con- 
tacts with infrared detection is also possible. 
[0020] In accordance with a further embodiment, 



the detection means may be continuously activated with 
appliances switched off. if a user is then detected, then 
firstly only the directional characteristic of the micro- 
phones is adapted and the speech recognition unit is 

5 switched on. If a use command, in particular a com- 
mand for switching on the appliance to be controlled, 
then ensues, the remaining functional groups of the 
appliance are also activated. In this way, the appliance 
is continuously ready for use with merely a low power 

io output demand of the appliance, 

[0021] The invention can be used for the voice-acti- 
vated remote control of a wide variety of appliances 
appertaining to consumer electronics, such as e.g. of 
TV sets, video recorders, DVD players, satellite receiv- 

15 ers, TV/video combinations, audio devices or complete 
audio systems, likewise of personal computers or of 
domestic appliances. 

Claims 

20 

1. Apparatus for voice-activated remote control, hav- 
ing one or more microphones (M1, M2) for convert- 
ing voice commands into electrical signals and 
having a speech recognition unit (REC) for convert- 

25 ing these electrical signals into operational com- 
mands, characterized in that means for detecting 
the whereabouts of the user (IR, V3, CIR) are pro- 
vided, which yield a signal which can be used for 
adapting the directional characteristic of the rnicro- 

30 phone or microphones to the whereabouts of the 
user. 

2. Apparatus according to Claim 1 , characterized in 
that the means for detecting the whereabouts of the 

35 user have one or more infrared motion detectors, 

3. Apparatus according to Claim 1 or 2, character* 
ized in that the means for detecting the wherea- 
bouts of the user have electrical contacts. 

40 

4* Apparatus according to one of the preceding 
claims, characterized in that a directional micro- 
phone is provided which is oriented to the wherea- 
bouts of the user. 

45 

5, Consumer electronics appliance having an appara- 
tus according to one of the preceding claims. 
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